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The distributions of the smaUest and largest eigenvalues for the matrix product Z^Z, where 
Z is an n X TO complex Gaussian matrix with correlations both along rows and down columns, 
are expressed as to x m determinants. In the case of correlation along rows, these expres- 
sions are computationally more efficient than those involving sums over partitions and Schur 
polynomials reported recently for the same distributions. 



1 Introduction 

A typical setting in multivariate statistics is to measure each of to variables xi, . . . , Xm a total of N times. 
For example, the variable Xk may denote the wind speed at weather station fc at a specific time of day; 
recording the value on successive days gives a sequence of values x): , j — 1, . . . , iV for the variable Xfc, 
which forms a column vector Xk — [x^i^]j=i,...,N ■ Collecting together the column vectors for each of the 
variables Xk gives the data matrix X — [xk\k=i,...,m- Let the average of the readings of variable Xk be 
denoted Xk, so that 

1 ^ 

i=i 

Let Xk — [xk]j=i,...,N be the corresponding (constant) column vector, and set X :— [xk\k=i,...,m- Forming 
now 



n 



1 1 

Ux - XfiX -X)^[l ^(4f - 4f - 4f ) 



^1,^2 — 1,- . .,m 



n = N—\, gives an empirical approximation to the covariance matrix [{{xki—{xk-^)){xk:i — {xk2)y\ki,k2='^,---,rn 
for the variables {xk}k=i,...,m- 

Analytic studies of the matrix A can be carried out in the case that the variables xi, . . . , Xm relating 
to the data matrix X are chosen from a multivariate Gaussian distribution with variance matrix S and 
mean fl. Then it is known (see e.g. 0) that the distribution of A is the same as that for the matrix 
product Y'^Y , where K is an m x n, n — N — 1, Gaussian matrix in which each row is drawn from a 
multivariate Gaussian distribution with covariance matrix S and mean zero. Thus the joint probability 
density function (p.d.f.) of the elements of Y is 

1 -Tr(S-y-y/2)^ (1.1) 

where here and throughout (unless otherwise stated) C represents some constant (i.e. quantity indepen- 
dent of the main variables of the equation, which here are the elements of Y). 



A standard practice in studying the empirical covariance matrix is to form the eigenvalue-eigenvector 
decomposition. This comes under the name of principal component analysis (see e.g. [lip. On a the- 
oretical front one seeks analytic forms for eigenvalue distributions of the matrix A = Y^Y when Y is 
distributed according to In fact the eigenvalue p.d.f. can be written down in terms of a multi- 

variable generalized hypergeometric function based on zonal polynomials (see e.g. This function 

is inherently difficult to compute, but there have been some recent advances j^. It is also possible 
to integrate over this p.d.f. to express the distribution of the largest eigenvalue as another generalized 
hypergeometric function |1(J| . 

In a recent work |15| a study of the p.d.f. for the smallest and largest eigenvalues of the matrix 
A = Z for Z a.n n X m, [n > m) complex Gaussian matrix with p.d.f. 

has been undertaken. Earlier studies had considered these p.d.f. 's in the case S = / [SIEI. The setting 
of complex data matrices is of great importance in recent quantitative studies of wireless communication 
(see e.g. ^1 A significant feature of the p.d.f. (|1.2|l is that the corresponding joint eigenvalue 

p.d.f. of A can be written as a determinant [SmElEI- Moreover, as to be shown in Section 2 below, the 
marginal distributions by way of the p.d.f. of the smallest and largest eigenvalues can also be evaluated 
as determinants. In contrast, these same distributions where evaluated in [15} as a sum over Schur 
polynomials and as a generalized hypergeometric function based on the Schur polynomials respectively 
(see and below). 

Suppose more generally that the complex data matrix Z has p.d.f. 

ie"Tr(s-zts-z)^ (1.3) 

Here S2 can be interpreted as the covariance coupling the measurements of a single variable z^. Very 
recently |16lll7j . it has been shown that for this distribution the canonical average 

det{l + uZ^ Z)P) (1.4) 



can be expressed as an n x n determinant, even though the joint eigenvalue p.d.f. of l|1.2|l cannot itself 
be written in a determinant form. In Section 3 we will use the method of IT to similarly express the 
p.d.f. for the smallest and largest eigenvalues of Z^ Z with Z distibuted as p.3|l in terms of determinants. 

2 Case of a single covariance matrix 
2.1 Correlation across rows of Z 

Consider the p.d.f. H1.2|l . Introduce the singular value decomposition 

Z = C/diag(/ii,...,/i„)y, (2.1) 

where U (V) is a to x m In x n) unitary matrix and the /i| —: Xj are the eigenvalues of the positive 
definite matrix Z^ Z. 

We seek the joint distribution of the {\j}j=i^...,m, p{M, ■ ■ ■ , A„i) say. Firstly, with A = Z'' Z, we know 



2 



(see e.g. 0) 



(dA) = -dct(Z^Z)"-'"(dZ) 



c 



(2.2) 



l<j<fe<m 



where (V'^dV) is the Haar measure (uniform distribution) on the space of 7n x rn unitary matrices U{m). 
Thus 

1 r 
p(Ai,...,A™) = inAr" n i^k-^^f ,-Tri^-^v'^-^.,iM,.-x^)v)^vUV). (2.3) 



l<j<k<m 



This is the well known Harish-Chandra/Itzykson-Zuber matrix integral (see e.g. 53)- has a closed 
form determinantal evaluation, which when substituted in H2.3(l implies 



p(Ai, . . . , A„) = i n n 7^^3TT det[e-^^^-],,,.i, 



(2.4) 



j = l l<j<fc<m 

where {si, . . . , Sm} are the eigenvalues of E^^. As referenced in the third sentence below 1)1.2(1 . the result 
((2.4|l has been made explicit in a number of recent works. 

Consider now the probability E{{X, oo)) that the interval (A, oo) is free of eigenvalues. This is related 
to the p.d.f. of the largest eigenvalue, p™'^^{X) say, by 



We have 
i?((A,^)) 



dXi 



p'"^'^(A) = -^i?((A,oo)). 



rfA,„p(Ai, . . . ,\rn) 



^ / dAi . . . / dX,n n Ar™ n(^fe - A,) det[e-^^^'=],,,=i, 



c 

Because both factors in the integrand are anti-symmetric in {Xj}j=i^,,,^m, and 

m 

n(Afe-A,)-AsymA;A2---A:;:-\ 

j<k 

the product can be replaced by X'(X2 ■ ■ ■ A™~^ provided we multiply by ml. Doing this we see the inte- 
grations over {Afc} can be performed column by column, to give 



E{{X,oo)) 



1 



■det 



l,...,m 



(2.5) 



To evaluate C, we note limA^oo E{{X, oo)) = 1. The integral in 1)2. 5|l can be evaluated in this limit to 
give 



1 = 



1 



det 



s [n — m + K — 1)\ 



j.k—l....,rn 



Factoring the factorials from the determinant and then making use of the Vandermonde determinant 
formula shows 

7n m 

C = (-l)™("-i)/2m! ]J(n - m + /c - 1)! J]^ sJ". 

fe=i j=i 



(2.6) 
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Substituting this in l|2.5|l . and changing variables Sj ^ Xsj in the integral therein, we obtain for our final 
expression 



E{{X,oo)) = 



1 



det 



jjfc— l,...,m 



(2.7) 



We remark that in the case that Sj = f (j — r + 1, . . . , m), the m —>■ oo limit of E{{X, oo)), with 
A, si, . . . , Sr appropriately scaled, is studied in p. We remark too that in ^| Corollary 3.3] E{{X, ooj) 
is expressed in terms of the generalized multi- variable hypergeometric function 



iFi(a,5;xi, . . . := ^ J°|^ Sf,{xi, . . . ,a:„). 

In (|2.8|l denotes the Schur polynomial labelled by a partition k — . . . , Km), '^i > ■ ■ • > Kti 

r(a - j + 1 + Kj) 



(2.8) 



n 



L r(a-j + l) 



while 



ml 



Im{n) := n 

1 <i<j <m 



(j - t + Kj - Kj) 

i - j 



Thus from TB", Eq. (3.5)] 



r(n + A:) 
fc=i ^ ^ j=i 

Comparing (j2.9|l and 1)2.5(1 gives the determinant formula 

T{n + k) 1 
jL^i r(A:)r(n - m + fc) nj<fc(2;/c - Xj) 



-Xsm)- 



(2.9) 



Fi(n;n + m;a:;i,...,x„i) = J]^ 



det 



n — m+A; — 1 Xjt 



jjfc— l,...,m 

(2.10) 

The integral in (|2.7(l is itself a special case of a one variable confluent hypergeometric function iFi, 
allowing us to write 



i?((A,oo)) 



1 



n™i(As.)'^ 



nr=i(" - m + fc)! nKfe(~^)('''= ~ 



X det 



i-F'i(ri — r7i + A;;n — TO + fc + l; — Asj) 



l,...,m 



(2.11) 



Note that H2.11|l and (|2.9|l are identical in the case m = 1. 

We draw attention to a limiting feature of ((2.5|l which is of relevance in the study of E{{X, oo)) for fully 
correlated matrices ()1.3|l . Suppose then that n = m in 1(2. 5|l . and consider the limit s„ ^ cxd. Integrating 
the final row of integrals by parts, we see that the dominant term is that in the first column. Expanding 
by this term shows 



lim ^;((A,oo)) 



EiiX,oo)) 



m—n—l 



and iterating this we have 



lim E{{X,oo)) =E{{X,co)) 



.s„ — >oo 



where on the right hand side E{{X, oo)) is for the general m case, as given by ((2.7(1 . The understanding of 
this result is that with s„_„i, . . . , s„ oo, the final n — m rows of Z become zero and so the eigenvalues 
of Z arc those of W^W for W the restriction of Z to its first m rows, together with m zero eigenvalues. 
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For the probability E{{0, A)) that the interval (0, A) is free of eigenvalues, related to the p.d.f. of the 
smallest eigenvalue, p™™(A) say, by 



we have 

E{{0,X)) / dAi 



P"""(A) = ^i?((0,A)), 



d\„ip{\i, . . . , Am) 



-| -1 />oo ^ 



c 

Proceeding now as in the derivation of (|2.7() shows 

m ! 1 



i?((0,A)) = 



to! e~'^^^=i*^ 



det 



j.n— m+A; — 1 — Sji 



j,A;— l,...,m 



■det 



(i + A) 



j./c— l,...,m 



(2.12) 



where C is given by (|2.ti|) . Note that for m — n the determinant only contributes a constant (i.e. term 
independent of A) and we have 

£:((0,A)) =e-^^^=l"^ (2.13) 

m— n 

which generalizes the same result known for si = • • • = s„ = 1 ^ 
In terms of the confluent hypergeometric function 

1 



U{a, b, z) := 



T{a) 



6-a-l 



dt. 



which unlike ii*i(a, b; z) is singular at z = 0, but like iFi is available as an inbuilt function on a number 
of mathematical computing packages, (|2.12(l reads 



X det 

The computationally more complex evaluation 



^^J-^ T{n~m + k) nj<fe(sfe - Sj) 
U{l,n-m + k + l, \sj)]j^k=i,...,7n- 



m(n — m) 



i?((0,A))=e-^S"^^^ E E 



(-^1 •)••••! ^m) 



(2.14) 



(2.15) 



fc=0 



K:|K| = fc 



is given in TF, Eq. (3.8)]. 

Also given in |15j is an expression involving sums over partitions and Schur polynomials for the 
probability i?((0,a) U (6, oo)) that there are no eigenvalues in either of the intervals (0, a) or (6, cxo). In 
terms of this quantity the joint p.d.f. for the smallest and largest eigenvalues, p(a, b) say, is given by 

92 



p{a,b) 



dadb 



E{{0,a) U (6,00)). 



From p{a, b) one can deduce the distribution of b/a, which is the square of the condition number and so 
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is of relevance in numerical analysis. The method of derivation of (|2.7|l yields the determinant evaluation 

E{{0,a) U (6,00)) 

= T^rFw A / rf^i • • • / dXra n n(^^- - A,)det[e-^^^'=],,fe.i,...,™ 

'-^ lljKkV'^k - Sj) J a J a 

where C is given by H2.6I) . 



(2.16) 



2.2 Correlation down columns of Z 



Consider next the case that Z has distribution 



f lfO)l with El = /). Now 



g-Tr(zts-iz) ^ g-Tr(S-iZZ+) 



(2.17) 



(2.18) 



and the non-zero eigenvalues of ZZ^ agree with the eigenvalues of Z^ Z ^ which we again denote {Xj} j=i....^m- 
With A = ZZ^ H2.2|) again apphes but with V replaced by U (recall H2.1|l ). and thus 



p(Ai, . . . , Am) 



l<j<fc<m 



lim 



-Tr(E2"Vdiag(Ai,...,A„)(7+) 



(C/^dC/). 



(2.19) 



(Here the limit could have been taken immediately, but as noted in |17| there are computational advan- 
tages in delaying this step.) Proceeding as in the derivation of (|2.4II gives 



p(Ai,...,A,„) = — lim 



det[e-^^-^'=],-fc=i, 



C x^+u-.K^Q n"<fc(Afe - Aj)(sfe - Sj) 
where here {si, . . . , s„} denotes the eigenvalues of S^^. Now taking the limit this reads 



- . i nr<.(A..-A,) 

p(Ai ,...,Xm) = — — : r det 



[e 



fc=i,. 



(2.20) 



The joint p.d.f. H2.20|l has been derived previously in pi I17|. 

The derivations of (|2.7|l and 12.12|l can be applied to 1)2. 20|) to deduce determinant formulas for 
£;((A,oo)) and E{{Q,X)). Thus we find 



and 



where 



£;((A,oo)) = 



EmX)) = 



det 



■ det 



A; = l,. 



■' k = l,. . . 



m n 



(2.21) 

(2.22) 
(2.23) 



fe=i j=i 
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3 Fully correlated case 



We turn our attention now to the case in which the data matrix Z is correlated both across 

rows and down columns. Here it does not appear possible to write the joint eigenvalue p.d.f. of Z'^ Z in 
determinant form. Nonetheless, it has been shown recently by Simon and Moustakos (see ^7] for 
a detailed presentation) that it is possible to give a determinant formula for the average H1.4|l . Here we 
will show that their calculation can be adopted to give determinant formulas for E{{X, oo)) and E{{0, A)) 
(the latter being restricted to the case m = n). 

To begin we suppose m — n. In the case of £^((A, oo)), by using the limiting procedure discussed in 
the paragraph below (|2.10() . a formula can be deduced from this for general m < n. Our starting point 
is the formula [T7j 

n n 

p(Ai, . . . , A„) = l[{r,s,rl[{Xk - A,)J({r,}, {s,}, {A,}) (3.1) 

3=1 j<k 

where, with the eigenvalues of Sj~^, given by {r^}, {si} respectively, 

Z({r,},{sJ,{A,J) 

= 1 V f] ^"^^'^ det[r^^']det[A^^']det[.^^'] 

In |17l Lemma 5] it is proved that J is bounded by an exponentially decaying function in Xj for each 
j = l,...,n. 

Consider first E{{0,X)). We thus seek to integrate each Xi in H3.1|l over {X,oo). For this we take 
inspiration from |17| and note from the Vandermonde determinant evaluation that 



A. - A\"-i 



A, 



X, - X 



j,k—l,...,n 



Expanding out the determinant according to its definition, substituting in and integrating gives 



J'^ nOO p 



dX 



N 



j=l PeSn 



X n(^^- - A)"-^(^' Af ^")-^X({r J, {A.}). (3.4) 
For each integration variable, we integrate by parts P{j) — 1 times, making use of the simple formula 

For m > — 1, {Xj — A)'"+^ vanishes at Xj — X (this is part of the motivation for the manipulation (|3.3() '). 
while from the remark below (|3.2|l the factor involving T vanishes at Aj = oo. Hence in the integration 
by parts there is no contribution from the end points. We must compute the partial derivatives with 
respect to Xj of aJ''"''^^!. For this note that the only term dependent of Xj in X is det[A*^'], and 

n 

n Af ^")-Met[A^^'] = det[A^^'+^(^")-^]. 
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Performing P{j) — 1 integration by parts in each variable Xj thus gives 

" \-ri(ri-l)/2 

j=i Pes 



pec p 



fei>--->/c„>0 j=l 



(-l)fc^ 



N 



dei[rf] det [h^'^ ( - ^) A,^'] detj^' 
]Tj<iiki-kj){ri-rj){si~ Sj) 



(3.5) 



Next we want to integrate row by row in the determinant. Although the integrand decays exponen- 
tially at infinity, this gives divergent integrals, as a result of interchanging the order of summation and 
integration. To overcome this, write 



/■oo pL 

I dXj = lim / dXj [j — 1 , . . . , n) , 



and so interchange only the finite range integrals with the summation of {/ij}. We see the resulting one 
dimensional integrals are the same down each column of the determinant and so can be factored. Further- 
more, the sum over P G Sn then simply interchanges rows in the determinant, which is compenstated 
for by £(P), thus contributing an overall factor of n\. Hence 



£;((0,A)) 



n(....rA""("-)/^£m^ E n 



("1)* 



det [r^'] det [4'] 



fel>--->*:n>Oi 



1=1 



Wj [t- A)""H'=' dt det [ W 

p=i 





+ P\\ 


V n 


-p)\ 



j,l = l,...,n 



L^i % ! n"</ - kj ) {n -rj){si- ) 

(3.6) 



As noted in ^7], it is straightforward to verify that 
det[n(-^)] 

L V 71 — p / J 7"./ = l,...,Tl 

thus cancelling YYj^iih — kj) and reducing H3.6|l to 
E{iO,X)) 

m—n ~ 

i=i 



(-1) 



n — 1 ^ n 

i=i j<i 



■ft ft — ± - 

i=i fci>-fc„>o 
^ _ii-t)^^\ det[r|']det[4-'] 



The lattice version of the well known Heine formula from random matrix theory (see e.g. [S]), 
d^J-ixi) ■ ■■ J dfi{xN) det[(t>j{xk)]j,k=i,...,Ndet[i)j{xk)]j^k=i,...,N 
= Nldet[J (j)j{x)'ipkix) dfj,{x)] 



(3.7) 



\j,k=l,...,N 



namely 



J2 det[a^]det[&,'^]nu;(fc,) = det [J2^ip)i^ 

ki>->kn>0 i=l p=0 



aibj) 



i J — l,...,n 
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referred to in as the Cauchy-Binet formula, allows the sum in 1)3. 7|l to be computed. Taking then the 
limit L — > cx) gives the sought determinant formula 



i?((0,A)) 



3 = 1 3 = 1 



1 



X det 



[ / [t- A)"-ie-*''^'^' dt] 



And changing variables 1 1-^ t + A in the integral allows H3.8f) to be simplified further, giving 



i?((0,A)) 



n 



1 



=" /i/'n;<,(-A)(r,-r,)(.,-.,) 



Jj,Z=l,...,n- 



(3.8) 



(3.9) 



Curiously this is the Harish-Chandra/Itzykson-Zuber matrix integral evaluation used in going from H2.3|) 
to (|2.4|l and so we have the matrix integral representation 



£;((0,A)) 



,-Tr(Afly + Sy) 



(3.10) 



where R,S are Hermitian matrices has eigenvalues {ri}, {si} respectively, and [V^^dV^] denotes the nor- 
malized Haar measure, ^[VUV] = 1. With TO = n, in the limit si, . . . , s„ 1 the p.d.f. I|1.3(l reduces to 
(|1.2|l . In keeping with this we can check that H3.9|l reduces to (I2.13|) (with Sj ^ rj {j — 1, . . . ,n) in the 
latter). 

We turn our attention now to E{(X, oo). For this we use a minor rewrite of (|3.3(l 



A- Aa"-i 



j<k 3 = 1 

in H3.1|) so that the analogue of H3.4|l reads 



Ai/2 



det 



A- 



A- A. 



i?((A,oo)) 



n(r,.,)"A-"("-i)/^ y: e{p) / dx,--- 

3=1 Pes„ ^0 ^" 

n 

X - A,)"-^(-'")Af (^^-^X({rJ, {sJ, {AJ). 



dA 



N 



3 = 1 

The procedure of going from H3.4|l to H3.8|l can now be enacted. (Note that in the integration by parts 
the factor 

n 

n(A-A,)"-^(-')Af-'')-^ 

3 = 1 

ensures that the integrand vanishes at the end points.) We thus arrive at the determinant evaluation 



^((A,oo)) 



1 n;ii(Ar,^s,-)" 



n 



det 



j,l=l,...,n 



(3.11) 



Analogous to the remark in the paragraph below (|3.10() . we must have that for si, . . . , s„ 1 (|3.11() 
coincides with (|2.7(l (after setting Sj ^ Vj {j — \, . . . ,n) hi the latter). Now, taking the limit si, . . . , s„ 
1 in 1)3.11(1 gives 



1 i\:uiM3Y 



(_}__\ i_ 



z(-A)(ri-r,) 



det 



j.k—l....,n 



(3.12) 
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Expanding the derivatives using the product rule, we see after elementary column operations that the 
determinant in (|3.12|) is equal to 



det 



n 



(n- 1)! 



det 



l\{n-k)\ 
But the determinant in (|3.13|) can be written 



j,k—l,...,n 

t 



1 - 1 



dt 



j,A;— l,...,n 



(3.13) 



dti 



dtn [|e-^'^^*^ (l -tj)"-Met 

= dh--- dtnH e-^'-^*^ l[(tk - t,) det 
^0 -^0 ^.^1 



f \ fe-1 



1-i, 



j.k—l....,n 



j,k—l,...,n 



(3.14) 



Substituting (|3.14|l in H3.13|l . then substituting the result in (|3.12|l reclaims l|2.7(l in the case m = n. 

It remains to apply the limiting procedure discussed in the paragraph below H2.1()(l to deduce from 
H3.11I) the evaluation for general m < n. Noting the asymptotic expansion 



,1 - ti-ir' '"'";,,'";"^" 

p—1 \ J / 



the required hmits can be taken to give 



71—1 ^ n — m — l -p/ X /TT"^ /TT^ \ ^ 



X det 



[/„ (1 -i)"^^e-^'-^-^'*dt],=i.....™ 



(3.15) 
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